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Abstract 

We will show the usefulness of the tools of Symplectic and Presymplectic Geometry and 
the corresponding Lie algebraic methods in different problems in Geometric Optics. 

1 Introduction: Symplectic and Presymplectic geometry 

Geometric techniques have been applied to physics for more than 50 years in many different ways 
and they have provided powerful methods of dealing with classical problems from a new geometric 
perspective. Linear representations of groups, vector fields, forms, exterior differential calculus, 
Lie groups, fibre bundles, connections and Riemannian Geometry, symmetry and reduction of 
differential equations, etc., are now well established tools in modern physics. Now, after more 
than twenty years of using Lie algebraic mehods in Optics by Dragt, Forest, Sternberg, Wolf and 
their coworkers, we aim here to establish the appropriate geometric setting for Geometric Optics. 
Applications in computation of aberrations for different orders will also be pointed out. 

The basic geometric structure for the description of classical (and even quantum) systems is 
that of symplectic manifold. A symplectic manifold is a pair (M, oj) where uo is a nondegenerated 
closed 2-form in M. If uo is exact we will say that (M, uo) is an exact symplectic manifold. Let 
Co : X(M) -> A\M) be given by uo(X) = i(X) uo, ub(X)Y = uo(X,Y). The two-form uo is said to 
be nondegenerate when uo is a bijective map. Then M is evendimensional and it may be used to 
identify vector fields on M with 1-forms on M. Vector fields Xh corresponding to exact 1-forms 
dH are called Hamiltonian vector fields. The 2-form uo is said to be closed if duo = 0. 

The simplest example is R 2n with coordinates (g 1 , . . . , q n ,Pi, ■ ■ ■ , p n ) endowed with the constant 
2-form uo = J2i=i dq l Adpi. Closedness of uo is very important because Darboux theorem establishes 
that for any point u G M there exists a local chart (U, <fi) such that if <j) — {q l , ■ ■ • , q n ', Pi, ■ ■ ■ ,Pn), 
then uo\u = J2i=i dq l A dpi. Consequently, the example above is the local prototype of a symplectic 
manifold.lt is also well known that if Q is the configuration space of a system, its cotangent bundle, 
T*Q = Uq£QT*Q, called phase space, is endowed with a canonical 1-form 6 on T*Q such that 
{T*Q, —d9) is an exact symplectic manifold. More especifically, if (q 1 , . . . , q n ) are coordinates in 
Q then (q 1 , . . . , q n ,pi, . . . ,p n ) are coordinates in T*Q and 9 = Y%=iPi dq l , w = Y^=\ dq l A dpi. 

A Hamiltonian dynamical systems is a triplet (M,uo, H) where M is a differentiable manifold, 
uo G Z 2 (M) is a symplectic form in M and H G C°°(M) is a function called Hamiltonian. The 



dynamical vector field Xh is then the solution of the equation i{Xh)oj = dH. In the example 
above 
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and the same expression is valid in Darboux coordinates for any Hamiltonian dynamical sys- 
tem. A Poisson bracket can be introduced in a symplectic manifold (M, u) by {F, G} = XqF = 
lv(Xf, Xq). Then, closedness of u is equivalent to Jacobi identity for P.B. Moreover, it may be 
shown that a = Cu~ l o d : C°°(M) — > X(M) is a Lie algebra homomorphism. 

A presymplectic manifold is a pair (M, a;) such that M is a differentiable manifold and u is 
a constant rank closed 2-form in M. The kernel of oo defines an integrable distribution (because 
duj = 0) and when the set of leaves is a manifold, it can be endowed with a symplectic structure. 
This process is called reduction of the presymplectic structure. 

Very interesting examples of HDS are those defined by regular Lagrangians, (TQ,u>l, El), 
with ojl = —d0L = —d(dL o S), El — AL — L. More accurately, the geometric approach to the 
Lagrangian description makes use of the geometry of the tangent bundle of the configuration space 
that we will shortly review. The tangent bundle tq\TQ — > Q is characterized by the existence 
of a vector field generating dilations along the fibres, called Liouville vector field, AgX(TQ), and 
the vertical endomorphism which is a (1, l)-tensor field S in TQ that in a natural coordinate 
system for TQ, induced from a chart in Q, are A = v l d/dv\ and S = (d/dv l ) <g> dq l . Given a 
function L e C°°(TQ), we define the 1-form 9 L G f\ l {TQ) by L = dLoS. When the exact 2-form 
ujl = —d&L is nondegenerate the Lagrangain L is called regular and then (TQ,ujl) is a symplectic 
manifold. The energy function El is given by El = A(L) — L. The coordinate expressions are 
6 L = (dL/dv^dq* and E L = v^dL/dv*) - L. 



2 Symplectic structures in Geometric Optics 

The set of oriented geodesies of a Riemannian manifold can be endowed with a symplectic structure 
and in particular the set of oriented straightlines in the plane, which is the set of light rays in a 
two-dimensional constant rank medium, can be endowed with a symplectic structure. Moreover, 
it can be considered as the cotangent bundle of the one-dimensional sphere S 1 . If an origin O 
has been chosen in the plane, every oriented straightline that does not pass through the point O 
is characterized by a unit vector s pointing in the line direction and a vector v orthogonal to s 
with end on the line and origin in O. The straightlines of a pencil of oriented parallel lines are 
characterized by proportional vectors v and the same s. Straightlines passing through O with 
direction given by s correspond to v = 0. The vectors v and s being orthogonal and s • s = 1, the 
couple (s, v) can be seen as a tangent vector to the unit circle S 1 at the point described by s. 

The Riemannian metric in S 1 can be used to identify in each point s the tangent space TgS 1 with 
its dual space T^S 1 and therefore the tangent bundle TS 1 with the cotangent bundle T*^ 1 . This 
identification shows us that the space of oriented straightlines in the Euclidean two-dimensional 
space can be endowed with an exact symplectic structure which corresponds to the canonical 
structure for the cotangent bundle T*5' 1 . The study of oriented straightlines in Euclidean three- 
dimensional space follows a similar pattern. 

A choice of coordinates in the base space will provide us Darboux coordinates: a good choice 
will be an angle coordinate. A straightline y = mx + b with slope m = tan 9 will be represented by 



a vector orthogonal to the vector s = (cos 9, sin 9), and length b cos 9, namely, v = bcos9 J|. The 
vector d/d9 is unitary in the Euclidean metric, and then the point (9,pe) G T*S X corresponding to 
(9,ve) is pe = vg. The symplectic form in TS l translated from the canonical symplectic structure 
in T*^ 1 ujq = d9 A dpg will be u = d9 A d{b cos 9) = d(sin9) A db. Therefore, Darboux coordinates 
for oj adapted to the cotangent structure are not only (9,b cos 9) but also q = sin 9, p = b, which 
are more appropriate from the experimental viewpoint. So, the flat screens arise here as a good 
choice for Darboux coordinates. 

The choice usually done in Geometric Optics is s-s = n 2 , the Darboux coordinate q then being 
q = n sin 9. This leads to the image of the Descartes sphere, a sphere of radius n whose points 
describe the ray directions. In the more general case of a variable refractive index, we recall that 
light rays trajectories in Geometric Optics are determined by Fermat's principle: the ray path 
connecting two points is the one making stationary the optical length: 5 J^nds = 0. 

This corresponds to the well-known Hamilton's principle of Classical Mechanics with an "op- 
tical Lagrangian" L = n + v 2 + v 2 , which is a differentiable function in TM 3 up to the zero 
section. In other words, the mechanical problem corresponding to Fermat's principle leads to a 
singular Lagrangian L(q,v) = [g(v , v )] 1//2 , where g is a metric conformal to the Euclidean met- 
ric g , g(v,w) = n 2 g (v,w). L is an homogeneous function of degree one in the velocities and 
consequently L is singular and the corresponding energy function vanishes identically. 

It was shown in |I] that it is possible to relate the solutions of the Euler-Lagrange equations 
for L with those of the regular Lagrangian L = |L 2 , up to a reparametrization. L is quadratic in 
velocities and the solution Tj^ of the equation i(TjJuj^ = dE-^ = dh is not only a second order 
differential equation vector field but also a spray, the projection onto M 3 of its integral curves being 
the geodesies of the Levi-Civita connection defined by g. The kernel of u>l is two-dimensional 
and it is generated by and the Liouville vector field A. The distribution keruiL is integrable 
because duji = 0; the distribution is also generated by A and K = -gFy , for which [A, K] = 0. 

If the refractive index for an optical system depends only on x 3 and the region in which the 
index is not constant is bounded, we can choose Darboux coordinates by fixing a x 3 outside 
this region and taking Darboux coordinates for the corresponding problem of constant index [[J. 
This justify the choice of coordinates for the ingoing and outgoing light rays in the constant index 
media, i.e. it shows the convenience of using flat screens in far enough regions on the left and right 
respectively, and then this change of Darboux coordinates seems to be, from an active viewpoint, 
a canonical transformation. Similar results can be obtained (see |fl) for nonisotropic media for 
which the refractive index depends only on the ray direction, i.e. n = n(v) and An = 0. The only 
difference is that u-^ may be singular, but in the regular case all works properly. 

3 Group theoretical approximations 

Mathematical expressions like x' = f(x) admit two different interpretations. In the aiias inter- 
pretation x and x' are coordinates of the same point in two different coordinate systems, while 
in the ad libi interpretation x are the coordinates of a point and x' those of its image under 
the transformation defined by /. In this sense a change of Darboux cordinates can be seen as 
a canonical transformation in M 2n , and in particular, when Darboux coordinates are chosen as 
indicated above, the passage of the set of light-rays through an optical device can be considered 



as a canonical transformation. Moreover, we can split an optical system in two subsystems and 
the canonical transformation factorizes as a product of two canonical transformations. Even if the 
group of canonical transformations is not a Lie group, any element g can be written as the expo- 
nential of an element in its Lie algebra, the set of Hamiltonian systems, g = expX/. Symmetry 
of the optical system leads to reduction, and then to a lower number of degrees of freedom. 

There exist formulae generalizing Baker-Campbell-Hausdorff for composition of generating 
functions, both in an exact or approximate way. Most of approximation formulae substitute the 
generating functions by a power series development and then only keep some terms, giving rise in 
this way to aberrations. For instance, if we only consider quadratic terms, we will get the linear 
approximation. 

The fundamental algebraic ingredients for the theory of approximate groups are the concepts 
of enveloping algebra U and symmetric algebra E of a Lie algebra g. Essentially, if {X 1 , . . . , X n } 
is a basis of q, then E is the algebra of polynomials in {Xi, . . . , X n }. Both U and E have 
graded Lie algebra structures extending that of q, which can be identified as a subalgebra of U 
and E. In the same way as g can be seen as the set of linear functions on q*, the symmetric 
algebra can be considered as the set of polynomials on g*- The adjoint action of G on q can 
be extended to an action Ad : G x E — > E in such way that Ad(g) is linear for each g G G 
and Ad(g)(pi.p 2 ) = Ad(g)(pi).Ad(g)(p 2 ). The extension of the adjoint action of the Lie algebra, 
ad : g x g — > g, ad(a, b) = ad(a)(fe) = [a, b], is the adjoint action of the symmetric algebra E: 

ad : E x E -> E, ad(pi,p 2 ) = ad(pi)(p 2 ) = [Pi,P2]s- 

For any p G E, we can also consider the formal transformation of E, (p(p) : E — > E, 

p' -> 4>{p){p') = exp(ad(p))(p') =p' + \p,p% + l/2[p, [p,p'] E ]s + • • • , 

and we should now consider the elements p for which such expresion is meaningful. They span a 
group Gs- The enlarged action of it reduces to the identity when acting on the set E 7 = {p G E | 
[a,p]s = 0, Va G g} of the polynomial Casimir elements of g. We shall then pass to the quotient 
graded Lie algebra E c = E/E 7 . Finally for approximation we will consider for each r G N the 
ideal spanned by H r = t>r E t , and then <f>(\p])H r C H r , and therefore it induces a map $ r ([p]). 

4 Perturbative treatment of aberrating optical systems 
using Weyl group 

A model for geometrical optics in a plane is obtained from the Weyl group W(l): it is a three- 
dimensional Lie group, with elements g G W(l) labelled by g — ^ G M 2 , v G and 
composition law g'g = (/ti + fi[, u' + v + A fii) where A denotes (a, b) A (c, d) = ad — be. 

A basis for the Lie algebra tt>(l), is given by Q = d a — \bd u , P — — \ad v , I — d v with Lie 
brackets [Q, I] = [P, I] = 0, [Q, P] = I . An infinite-dimensional basis for the associated symmetric 
algebra E is given by {1, 7, Q, P, I 2 , IQ, IP, Q 2 , QP, P 2 , . . .}. <j>(\J + \ 2 Q + \ 3 P) is in fact an 
element of G. Its generalized adjoint action on E preserves each subspace E r . It is enough to 
know its action on Ei, (oil + c<iQ + c^P — > ((ci — A3C2 + A2C3)/ + c 2 Q + C3P). Another typical 
element of Gs is given by 0(^i/ 2 + v 2 IQ + v?,IP + V4Q 2 + v^QP + vqP 2 ), which in fact is only a 



formal map. However, its projected maps on P r are well defined. The infinitesimal adjoint action 
ad(z/i/ 2 + V2IQ + + U4Q 2 + v^QP + uqP 2 ) maps each E r onto E r+ i, so that only its action 
on Ei is not trivial when considering its projected map on P 2 =E © Si © E 2 . 

The Casimir elements of tu(l) are the polynomial functions on I and 1. A basis for the quo- 
tient algebra E c is {[l] c , [Q] c , [P] c , [Q 2 ] c , [QP] C , [P 2 ] c , . . .}. The reduction process can be obtained 
by quotient by the ideal generated by the Casimir /. A typical element of the group Gj^c is 
4> c {vi[Q 2 ] c + V2lQP] c + ^[P 2 ] ). Its projected action on P^ is given by the matrix 



<i> c (MQ 2 } c + MQP} c + MP 2 } c ) = 
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where M G SL(2,M) and D 2 (M) is the image of M in the three-dimensional representation: 
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The matrix M associated to the map c (^i[Q 2 ] c + ^[QP] + ^[-P 2 ] ) is given by 
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with iv 



= ±\jv\ - 4z/iZ/ 3 . 



A matrix representation for the group G^ provides a perturbative treatment of (r — l)-th 
order (in 1+1 dimensions). Using the factorization theorem and the axial symmetry of the system, 
only transformations of type (p c ([p] c ) with [p] c G E^ or E^ must be considered for third order 
aberrations. <fi c (vi[Q 2 ] c + ^[QP] + ^[-P 2 ] ) has a matrix representation immediate generalization 



of the above mentioned on P 2 . The element 4 (/ii[<5 4 ] c + /U 2 [Q 3 -P] C 
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The representation splits into two, acting respectively on the even and odd degree subspaces. The 
fisrt one can be used to find the composition law and the second one can be used to find the 
approximate coadjoint action on the coordinate functions q and p. 

The free propagation, till the third degree approximation is p' = p, q' = q + ^p + ttsP 3 , and 
the group element will be of the form 0([p 2 ] c ) °0([p4] c ), with [pi] c G The matrix M G SL(2, M.) 

10" 



associated to 0([p 2 ] c ) is M 



z/n 1 



while [p 4 ] c = --^a[P A } c , so that, {^1,^2,^3,^4,^5} 



{0, 0, 0, 0, -fs} determines M x and M 2 . 



The same calculus for a refracting surface gives 

/ i m-m \ 3n 2 — m— 2^ 1 — m 

M=( f ).and^{ SR3 - ,-^,^,0,0}. 



5 Example 

In the design of a doublet we have seven of these basic systems concatenated so that the total 
system (chosen to be telescopic) is obtained in third order approximation by composition formulae 
of the corresponding third order aberration group. The composition of the systems (M 2 ,fi 2 ) 
and (Mi, fix), with Mj the linear approximation matrices and /ij the coefficients of the fourth 
order polynomials, is obtained by the formula (M 2 Mi, D 4 (M 1 _1 )/x 2 + /ii), D 4 (M) being the former 
representation of matrix M on the fourth order polynomial space. In our example, a concatenation 
of compositions for the doublet gives way to a total linear approximation matrix (on which we can 
impose the telescopic condition 7 = and a given factor of magnification, say 5 = 5) and a total 
fourth order polynomial. Fixing the refractive indexes of the lenses as n x = 7/4 and n 2 = 9/4 we 
obtain a seven parameter system of equations. The polynomic expression of fi in terms of Si and 
Zj (the radii and lengths of the lenses) is a set of eleven to sixteen degree polynomials, so that 
numerical calculus should be used to obtain solutions with zero third order aberrations. 

The example does not try to be a realistic design, in which a four dimensional space should be 
used on the q's and p's, chromatic aberration should be taken into account through the dependence 
on the refractive index with the wave length, and stability of the solution to errors on the param- 
eters should be considered. For our playing design system, and in order to simplify the calculus, 
we can fix some of the parameters in terms of the other ones, say S 2 = S3 = zi/4, z% = z\ and 
z 2 = 2zi, so that only three parameters are left free. Taking into account the telescopic condition 
and the given factor of magnification we are left with just one parameter, which can be used to 
minimize the square of the /i vector. 

A numerical solution obtained for this simplified case is Z\ = 0.5925, which gives a linear 
approximations for the total system and a total vector /i to tai 

/ 9 1 75R7 \ 
M total = ( 2 9810 -i 9 5 J - itotai = {0.1295, -1.4124, 1.2882, 0.7881, -2.3739}. 
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